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3 <110> APPLICANT: KANG, Hyun-Ah 

4 RHEE, Sang-Ki 

5 SOHN, Min-Jeong 

6 KIM, Jeong-Yoon 

8 <120> TITLE OF INVENTION: HANSENULA POLYMORPHA YAPSIN DEFICIENT MUTANT STRAIN AND 
PROCESS 

9 FOR THE PREPARATION OF THE RECOMBINANT PROTEINS USING THE SAME 
11 <130> FILE REFERENCE: P26459 

13 <140> CURRENT APPLICATION NUMBER: US 10/518,377 

14 <141> CURRENT FILING DATE: 2004-12-28 

16 <150> PRIOR APPLICATION NUMBER: PCT/KR2003/001279 

17 <151> PRIOR FILING DATE: 2003-06-28 
19 <160> NUMBER OF SEQ ID NOS : 16 

21 <170> SOFTWARE: Patent In version 3.3 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 3151 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Hansenula polymorpha 

28 <400> SEQUENCE: 1 

29 agttgagtcg caatagtgtg gcgaacttca aatgccctta ctgtccgcga acaaccacca 60 
31 ttgcccaggc tgtgcaggcc agatttgtta atttgtgaaa agtggaaaaa atttattccg 120 
33 ctatgcctaa ccgaagagcc cgcaagaaga ggcggacaga agacttttcc agctcttcgg 180 
35 catctgaaaa cgatagtgac tccgagagcg tgaccagtgt acaggaagag cagccggatg 240 
3 7 cgcccgaaac atacacaata gatggcctgg acacgcaaga ggtgtctgac agcacacagg 300 
39 tgagactcca acagctgaac gcagacaggt tggccagcat agagcaaagc ctttcaggca 360 
41 acctcaaact ggacataaac gcagtacgcc agatagatga tgtgcgtgag cagctgcaga 420 
43 acgagtattt gaagaaattg cttgtcacat attctgagga cctggatgcg ctgcgtcaga 480 
45 aaaccgattt caaggaaaac tcactcaaaa ccctcgcccg tcttctcaaa gagagcggaa 540 
47 acatatttga tgatggaact ctcaagtcgc tagttgagtg atgtatatga taatgtctaa 600 
49 ttttaatttt catcagtgtg caagatctgg gcttagccgt tctaaatggt atattcaggc 660 
51 tgtgcaagcc acatttaaaa ttaccccatc ggtttttaaa ttctattgtt agaaattagg 720 
53 atctacatag aggtagagtg agcaacagaa cattgtttgc tatccgggcc ctccgactgg 780 
55 aacgtcttac cttcagctac tatttattca gaaaaaagag tgcattttca tctatcaagg 840 
57 tctcaaagtg tcgaatcaaa tcactagtat tttttccgag actaaaaaaa agttgacaca 900 
59 atgaaagttg ctacactgtt tttcttggct tcgagtgtct gtgtgctggg agacccacag 960 
61 ttcgtgaaac tggaggcctc tgttcttcgg ggatccactt acaaggattc ccagaagggg 1020 
63 gccaagccgt tcatgttgga aaagagggct gatgacggct cggtcacgat ggaattgcag 1080 
65 aacgcccagt ctttctacca agtcgagatc gagataggat ctgataagca gaaggtgggg 1140 
67 gttttgattg ataccggttc ctcggacttg tgggtgatga actcgaataa ctcttactgt 1200 
69 tcgtcttcca gcactaaaaa attgaaacgg gacggaccgg ccgatgcgct acaaaaagga 1260 
71 cgcgatcttt ccgacctgta caatttcaac tctccaaacg aagacaacaa tgcaaaagga 1320 
73 ttcttgggtg gctggggaga cttgaccaca gtagagactg caacccagga tgagacacag 13 80 
75 acggctctcg ctgcgcaggc caccgtggac tgctcgctat acggaacgtt caatccttca 1440 
77 acgtccaatt cgttccacaa caacggcacc acatttgaga tttcgtacgc ggaccgcact 1500 
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79 tttgcccgtg gaacctgggg ctacgatgat gtcactttca atggtgtcac ggttaacgat 1560 
81 ctctcgttgg ccgtggcaga tgaaacagat tcttcgactg gtgtttttgg tatcggattg 1620 
83 agggaattgg aaaccacata ctcaggaggc ggaccacagc attacatcta cgacaactta 1680 
85 cctttcaaaa tggtcgacca gggactcatc aatagagccg cctattccgt ctacctgaac 1740 
87 tcaactgagt ccagcactgc ctcgatcctc ttcggtgcgg ttgaccaaag caaatatacc 1800 
89 ggaagtcttg gcttgcttcc tatcatcaac acggctgctt cctacggtta ccaaaagcct 1860 
91 ctaaggctcc aaatcaccct gtctgccatt acggtcagcg actccagagg acagcaagca 1920 
93 agcattggtt caggagctgc tgctgcactt cttgataccg gaacgacttt gacgtatgct 1980 
95 ccaagcgaga ttgtcgagaa acttgctgaa accctaggct tcgactacag cagctctgtc 2040 
97 ggggcctacg tggcaagatg cagggacgtt gatagctacg ctgtcaactt cgacttccag 2100 
99 ggtaaagtga ttgaagctcc tttgagttcc ttcctgattg ctctgcaaac caactccgga 2160 
101 gaagtttcct cctactgcgc attgggtatt ttctcctctg gagacgaatc cttcacgctc 2220 
103 ggcgatactt tcctgcgaaa cgcctacttt gtggctgacc tcgagggata tcaaatcgct 2280 
105 atagctaacg tgaacctgaa tcctggagcc gagcaaattg aggtcatctc aggcaactcc 2340 
107 attccttctg cttcgtcggt ttccgattac tccaatacct ggggcgcctc tgccaccgct 2400 
109 ttggacactg acaggcctac tactctggga tctgtgactg ctgtgggcga tgaaagagtg 2460 
111 acctcgacca agaaggtttc gagtgtgaag acaagcactt cgtccgggtc cgggtccact 2520 
113 tcggagtcgt ctacgtccag ttcgcattcc agcaatggcc caaggacagt aggctttagt 2580 
115 ttgtgtgccg ttttgtgcgc attcttgatt tctatactag ttgtttgcta gatctgaagt 2640 
117 tctaaggggc tttagtcttc atttatgatt tttttttatt tggaccgcct cgaattgttt 2700 
119 ttccgacggg tctactttaa agctgcaaga tctcgtttag cgtcgtttat ttctcgttcg 2760 
121 tttagtgaca aaaaaacaga aaaaaaaact ataaaaagcg gtatataacc tttatatttt 2820 
123 gataaacatg agcagcgaaa ttaagctagc accaaaggat tacgagaagg acaaggagtt 2880 
12 5 cgccaaggct ctgcatggca aggacgccgc gagcgctaca ggaatgagtg cttgggtgaa 2 940 
12 7 gaaggacaag gaagctcaaa aagtcgcgat ggaaggatat ttcaagcact gggacgggaa 3 000 

12 9 aaccgacgag gagactgaaa agtcgagact cgaggactac tcgacgctca ccaagcacta 3 060 
131 ctacaacctg gtgacggatt tctacgagta tggatgggga tcctcgttcc acttttccag 3120 
133 atactacaag ggagagccat ttagacaagc t 3151 

136 <210> SEQ ID NO: 2 

137 <211> LENGTH: 576 

13 8 <212> TYPE: PRT 

13 9 <213> ORGANISM: Hansenula polymorpha 
141 <400> SEQUENCE: 2 

143 Met Lys Val Ala Thr Leu Phe Phe Leu Ala Ser Ser Val Cys Val Leu 

144 15 10 15 

147 Gly Asp Pro Gin Phe Val Lys Leu Glu Ala Ser Val Leu Arg Gly Ser 

148 20 25 30 

151 Thr Tyr Lys Asp Ser Gin Lys Gly Ala Lys Pro Phe Met Leu Glu Lys 

152 35 40 45 

155 Arg Ala Asp Asp Gly Ser Val Thr Met Glu Leu Gin Asn Ala Gin Ser 

156 50 55 60 

159 Phe Tyr Gin Val Glu He Glu He Gly Ser Asp Lys Gin Lys Val Gly 

160 65 70 75 80 

163 Val Leu He Asp Thr Gly Ser Ser Asp Leu Trp Val Met Asn Ser Asn 

164 85 90 95 

167 Asn Ser Tyr Cys Ser Ser Ser Ser Thr Lys Lys Leu Lys Arg Asp Gly 

168 100 105 110 

171 Pro Ala Asp Ala Leu Gin Lys Gly Arg Asp Leu Ser Asp Leu Tyr Asn 

172 115 120 125 
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272 515 520 525 

275 Val Lys Thr Ser Thr Ser Ser Gly Ser Gly Ser Thr Ser Glu Ser Ser 

276 530 535 540 

279 Thr Ser Ser Ser His Ser Ser Asn Gly Pro Arg Thr Val Gly Phe Ser 

280 545 550 555 560 

283 Leu Cys Ala Val Leu Cys Ala Phe Leu lie Ser lie Leu Val Val Cys 

284 565 570 575 

287 <210> SEQ ID NO: 3 

288 <211> LENGTH: 25 

289 <212> TYPE: DNA 

290 <213> ORGANISM: Artificial Sequence 

292 <220> FEATURE: 

293 <223> OTHER INFORMATION: primer 
295 <400> SEQUENCE: 3 

2 96 gaagtgcagc agcagctcct gaacc 2 5 

299 <210> SEQ ID NO: 4 

300 <211> LENGTH: 26 

301 <212> TYPE: DNA 

302 <213> ORGANISM: Artificial Sequence 

304 <220> FEATURE: 

305 <223> OTHER INFORMATION: primer 

307 <400> SEQUENCE: 4 

308 ggctgatgac ggctcggtca cgatgg 26 

311 <210> SEQ ID NO: 5 

312 <211> LENGTH: 20 

313 <212> TYPE: DNA 

314 <213> ORGANISM: Artificial Sequence 

316 <220> FEATURE: 

317 <223> OTHER INFORMATION: primer 
319 <400> SEQUENCE: .5 

32 0 ggacacgcaa gaggtgtctg 2 0 

323 <210> SEQ ID NO: 6 

324 <211> LENGTH: 40 

325 <212> TYPE: DNA 

326 <213> ORGANISM: Artificial Sequence 

328 <220> FEATURE: 

329 <223> OTHER INFORMATION: primer 

331 <400> SEQUENCE: 6 

332 agctcgctac ccggggatcc gcaactttca ttgtgtcaac 40 

335 <210> SEQ ID NO: 7 

336 <211> LENGTH: 40 

337 <212> TYPE: DNA 

338 <213> ORGANISM: Artificial Sequence 

340 <220> FEATURE: 

341 <223> OTHER INFORMATION: primer 

343 <400> SEQUENCE: 7 

344 gcacatcccc ctttcgccag cctcttcggt gcggttgacc 40 

347 <210> SEQ ID NO: 8 

348 <211> LENGTH: 20 
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349 <212> TYPE: DNA 

350 <213> ORGANISM: Artificial Sequence 

352 <220> FEATURE: 

353 <223> OTHER INFORMATION: primer 

355 <400> SEQUENCE: 8 

356 gctcggctcc aggattcagg 20 

359 <210> SEQ ID NO: 9 

360 <211> LENGTH: 20 

361 <212> TYPE: DNA 

362 <213> ORGANISM: Artificial Sequence 

364 <220> FEATURE: 

365 <223> OTHER INFORMATION: primer 

367 <400> SEQUENCE: 9 

368 ggatccccgg gtaccgagct 2 0 

371 <210> SEQ ID NO: 10 

372 <211> LENGTH: 20 
3 73 <212> TYPE: DNA 

374 <213> ORGANISM: Artificial Sequence 

376 <220> FEATURE: 

377 <223> OTHER INFORMATION: primer 

379 <400> SEQUENCE: 10 

380 caccggtagc taatgatccc 20 

383 <210> SEQ ID NO: 11 

384 <211> LENGTH: 20 
3 85 <212> TYPE: DNA 

386 <213> ORGANISM: Artificial Sequence 

388 <220> FEATURE: 

389 <223> OTHER INFORMATION: primer 
391 <400> SEQUENCE: 11 

3 92 cgaacatcca agtgggccga 2 0 

395 <210> SEQ ID NO: 12 

396 <211> LENGTH: 20 
3 97 <212> TYPE: DNA 

398 <213> ORGANISM: Artificial Sequence 

400 <220> FEATURE: 

401 <223> OTHER INFORMATION: primer 

403 <400> SEQUENCE: 12 

404 ctggcgaaag ggggatgtgc 20' 

407 <210> SEQ ID NO: 13 

408 <211> LENGTH: 24 

409 <212> TYPE: DNA 

410 <213> ORGANISM: Artificial Sequence 

412 <220> FEATURE: 

413 <223> OTHER INFORMATION: primer 

415 <400> SEQUENCE: 13 

416 gaattcatga agtgggtaac cttt 24 

419 <210> SEQ ID NO: 14 

420 <211> LENGTH: 20 

421 <212> TYPE: DNA 
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